Determining Case in Arabic: Learning Complex Linguistic Behavior Requires Complex Linguistic Features

نویسندگان

  • Nizar Habash
  • Ryan Gabbard
  • Owen Rambow
  • Seth Kulick
  • Mitchell P. Marcus
چکیده

This paper discusses automatic determination of case in Arabic. This task is a major source of errors in full diacritization of Arabic. We use a gold-standard syntactic tree, and obtain an error rate of about 4.2%, with a machine learning based system outperforming a system using hand-written rules. A careful error analysis suggests that when we account for annotation errors in the gold standard, the error rate drops to 0.8%, with the hand-written rules outperforming the machine learning-based system.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Role of Non-Linguistic Variables in Production of Complex Linguistic Structures by Hearing-Impaired Children

Objectives: Language development is often very slower in hearing impaired children compared with their normal peers. Hearing impairment during childhood affects all aspects of speech production and language acquisition. It seems that hearing impaired people suffer from language and speech impairments such as production of complex linguistic structures. The purpose of this study is to determine ...

متن کامل

The Effects of Task Complexity on Input-Driven Uptake of Salient Linguistic Features

The present study investigated the effects of cognitive complexity of pedagogical tasks on the learners’ uptake of salient features in the input. For the purpose of data collection, three versions of a decision-making task (simple, mid, and complex) were employed. Three intact classes (each 20 language learners) were randomly assigned to three groups.  Each group transacted a version of a decis...

متن کامل

Bourdieu and Genette in Paratext: How Sociology Counts in Linguistic Reasoning

While Bourdieu’s theory of practice provides an ensemble of conceptual tools which analyze patterns of social life that are irreducible to the limiting view of individuals as free-acting agents, Genette’s paratextual theory offers the metalanguage necessary to account for the microcosm of paratext as a linguistic space. This study takes issue with unidirectional approaches to researching parate...

متن کامل

Perception Development of Complex Syntactic Construction in Children with Hearing Impairment

Objectives: Auditory perception or hearing ability is critical for children in acquisition of language and speech hence hearing loss has different effects on individuals’ linguistic perception, and also on their functions. It seems that deaf people suffer from language and speech impairments such as in perception of complex linguistic constructions. This research was aimed to study the pe...

متن کامل

Constitutive Features of the Russian Political Discourse in Ecolinguistic Aspect

The article offers a comparative description of typological mechanisms used in political communicative practice and methods of verbal explication of its axiological and symbolic constituents determining universal mental features of individual/collective consciousness. The research position based on a systemic multilevel analysis of the component structure of discourse facilitates the identifica...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007